The influence of speech rate on Fujisaki model parameters

نویسندگان

Hansjörg Mixdorff

Adrian Leemann

Volker Dellwo

چکیده

The current paper examines influences of speech rate on Fujisaki model parameters based on read speech from the BonnTempo-Corpus containing productions by 12 native speakers of German at five different intended tempo levels (very slow, slow, normal, fast, fastest possible). The normal condition was produced at an average rate of 6.34 syllables/s or 100%, the very slow version at 67%, and the fastest version at 161% of the normal rate. We extracted F0 contours and subjected them to decomposition using the Fujisaki model. We ordered all the data with respect to their actual speech rates. First, we assessed how prosodic realizations vary with speech rate and examined phrase command magnitudes, the number of phrase commands as well as the base frequency, accent command amplitudes, and the timing of accent command with respects to the underlying syllables and their nuclear vowels. Second, we analyzed between-sentence variability within and between speakers and investigated whether and how the prosodic structure is preserved at different speech rates. For very slow speech, we found for some of the speakers that the original phrase structure had disintegrated into something like a list of isolated words separated by pauses. Very fast speech became chains of uniform syllables at very high pitch and with almost flat intonation. With respect to the F0 range reflected by the amplitude of accent commands, we found strong interspeaker differences. While four of the subjects exhibited a significant reduction at higher speech rates, the others did not. As speed increases, it appears that F0 gestures commence earlier in the syllable, that is, the onset time of accent commands is located closer to the syllable/vowel onset than at lower speed. DOI: https://doi.org/10.1186/s13636-014-0033-6 Posted at the Zurich Open Repository and Archive, University of Zurich ZORA URL: https://doi.org/10.5167/uzh-103013 Published Version Originally published at: Mixdorff, Hansjörg; Leemann, Adrian; Dellwo, Volker (2014). The influence of speech rate on Fujisaki model parameters. EURASIP Journal on Audio, Speech, and Music Processing, 2014(33):online. DOI: https://doi.org/10.1186/s13636-014-0033-6 RESEARCH Open Access The influence of speech rate on Fujisaki model parameters Hansjörg Mixdorff, Adrian Leemann and Volker Dellwo

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the effects of speech rate upon parameters of the command-response model for the fundamental frequency contours of speech

A command-response model for the process of F0 contour generation has been presented by Fujisaki and his coworkers. The present paper describes the results of a study on the variabilty and speech rate dependency of the model’s parameters in utterances of a speaker of Japanese. It was found that parameters α and β can be considered to be practically constant at a given speech rate, while Fb may ...

متن کامل

A novel approach to the fully automatic extraction of Fujisaki model parameters

The generation of naturally-sounding F0 contours in TTS is important for the intellegibility and perceived naturalness of synthetic speech. In earlier works the author developed a linguistically motivated model of German intonation based on the quantitative Fujisaki model of the production process of F0. The extraction of parameters for this model from the extracted F0 contour, however, poses p...

متن کامل

Statistical Approach to Fujisaki-model Parameter Estimation from Speech Signals and Its Quantitative Evaluation

We have previously proposed a statistical model of speech F0 contours, which is based on the discrete-time version of the Fujisaki model. One advantage of this model is that it allows us to introduce statistical methods to learn the Fujisaki-model parameters from speech F0 contours. This paper proposes several modifications to our previous model and parameter inference algorithm, and quantitati...

متن کامل

Statistical Approach to Fujisaki - Model Parameter Estimation from Speech Signals

متن کامل

Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech

This paper proposes a stochastic model of speech F0 contours, based on the stochastic formulation of the Fujisaki model. Our motivation for the stochastic formulation is twofold. Firstly, it allows us to derive a well-behaved algorithm for estimating the Fujisaki model parameters from a raw F0 contour. Secondly, it will open the door to incorporating the well-founded F0 contour model into vario...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

EURASIP J. Audio, Speech and Music Processing

دوره 2014 شماره

صفحات -

تاریخ انتشار 2014

The influence of speech rate on Fujisaki model parameters

نویسندگان

چکیده

منابع مشابه

On the effects of speech rate upon parameters of the command-response model for the fundamental frequency contours of speech

A novel approach to the fully automatic extraction of Fujisaki model parameters

Statistical Approach to Fujisaki-model Parameter Estimation from Speech Signals and Its Quantitative Evaluation

Statistical Approach to Fujisaki - Model Parameter Estimation from Speech Signals

Hidden Markov Convolutive Mixture Model for Pitch Contour Analysis of Speech

عنوان ژورنال:

اشتراک گذاری